Using Encyclopedic Knowledge for Named entity Disambiguation

نویسندگان

  • Razvan C. Bunescu
  • Marius Pasca
چکیده

We present a new method for detecting and disambiguating named entities in open domain text. A disambiguation SVM kernel is trained to exploit the high coverage and rich structure of the knowledge encoded in an online encyclopedia. The resulting model significantly outperforms a less informed baseline.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting and Identifying Hypertext in Wikipedia Articles

1. Ratinov, Roth, Downey, and Anderson. Local and Global Algorithms for Disambiguation to Wikipedia. (University of Illinois at Urbana-Champaign). Retrieved from http://web.eecs.umich.edu/~mrander/pubs/RatinovDoRo.pdf 2. Zhou, Nie, Rouhani-Kalleh, Vasile, and Gaffney. Resolving surface forms to Wikipedia topics. (ACM Digital Library). Retrieved from http://dl.acm.org/citation.cfm?id=1873931 3. ...

متن کامل

Large-Scale Named Entity Disambiguation Based on Wikipedia Data

This paper presents a large-scale system for the recognition and semantic disambiguation of named entities based on information extracted from a large encyclopedic collection and Web search results. It describes in detail the disambiguation paradigm employed and the information extraction process from Wikipedia. Through a process of maximizing the agreement between the contextual information ex...

متن کامل

Named Entity Linking Based On Wikipedia

In this paper, we present the ideas and methodologies on labeling the mentioned entities with the wiki dataset. This paper presents a system for the recognition and semantic disambiguation of named entities based on information extracted from a large encyclopedic collection from Wikipedia. We focus on maximizing the similarity between the contextual information extracted from Wikipedia and the ...

متن کامل

Annotating the MASC Corpus with BabelNet

In this paper we tackle the problem of automatically annotating, with both word senses and named entities, the MASC 3.0 corpus, a large English corpus covering a wide range of genres of written and spoken text. We use BabelNet 2.0, a multilingual semantic network which integrates both lexicographic and encyclopedic knowledge, as our sense/entity inventory together with its semantic structure, t...

متن کامل

Chinese Named Entity Recognition and Disambiguation Based on Wikipedia

This paper presents a method for named entity recognition and disambiguation based on Wikipedia. First, we establish Wikipedia database using open source tools named JWPL. Second, we extract the definition term from the first sentence of Wikipedia page and use it as external knowledge in named entity recognition. Finally, we achieve named entity disambiguation using Wikipedia disambiguation pag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006